Mercury BLAST dictionaries: analysis and performance measurement

ثبت نشده
چکیده

This report describes a hashing scheme for a dictionary of short bit strings. The scheme, which we call near-perfect hashing, was designed as part of the construction of Mercury BLAST, an FPGA-based accelerator for the BLAST family of biosequence comparison algorithms. Near-perfect hashing is a heuristic variant of the well-known displacement hashing approach to building perfect hash functions. It uses a family of hash functions composed from linear transformations on bit vectors and lookups in small precomputed tables, both of which are especially appropriate for implementation in ardware logic. We show empirically that for inputs derived from genomic DNA sequences, our scheme obtains a good tradeoff between the size of the hash table and the time required to ompute it from a set of input strings, while generating few or no collisions between keys in the table. One of the building blocks of our scheme is the H_3 family of hash functions, which are linear transformations on bit vectors. We show that the uniformity of hashing performed with randomly chosen linear transformations depends critically on their rank, and that randomly chosen transformations have a high probability of having the Type of Report: Other Department of Computer Science & Engineering Washington University in St. Louis Campus Box 1045 St. Louis, MO 63130 ph: (314) 935-6160 Mercury BLAST Dictionaries: Analysis and Performance Measurement Jeremy Buhler [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acceleration of ungapped extension in Mercury BLAST

The amount of biosequence data being produced each year is growing exponentially. Extracting useful information from this massive amount of data efficiently is becoming an increasingly difficult task. There are many available software tools that molecular biologists use for comparing genomic data. This paper focuses on accelerating the most widely used such tool, BLAST. Mercury BLAST takes a st...

متن کامل

Mercury BLASTN: Faster DNA Sequence Comparison using a Streaming Hardware Architecture

Motivation: Large-scale DNA sequence comparison, as implemented by BLAST and related algorithms, is one of the pillars of modern genomic analysis. One way to accelerate these computations is with a streaming architecture, in which processors are arranged in a pipeline that replicates the multistage structure of the algorithm. To achieve high performance, the processor hardware implementing the ...

متن کامل

ارائه یک مکانیزم مناسب جهت کاهش فراجهش سازه تحت بار انفجار

as in order to protection of a blast loaded structure, three ways can be utilized: (1) increasing the stand-off using natural or artificial barriers, (2) increasing the structure strength or performance level, (3) decreasing the energy of blast using the passive, semi active, active and hybrid control methods. In recent method, using controlling mechanism and tools, the effect of blast waves is...

متن کامل

Design and Evaluation of a BLAST Ungapped Extension Accelerator, Master's Thesis

The amount of biosequence data being produced each year is growing exponentially. Extracting useful information from this massive amount of data is becoming an increasingly difficult task. This thesis focuses on accelerating the most widely-used software tool for analyzing genomic data, BLAST. This thesis presents Mercury BLAST, a novel method for accelerating searches through massive DNA datab...

متن کامل

Effects of Partial Substitution of Styrene-butadiene-styrene with Granulated Blast-furnace Slag on the Strength Properties of Porous Asphalt

The present experimental research investigates the feasibility of partial substitution of styrene-butadiene-styrene (SBS) with ground granulated blast-furnace slag (GGBS) for the modification of bitumen and porous asphalt mixtures. The control asphalt mixture and the seven modified porous asphalt mixtures have been analyzed separately and their performance was compared. Modified bitumen and asp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016